Decentralized Planning in Stochastic Environments with Submodular Rewards

نویسندگان

Rajiv Ranjan Kumar

Pradeep Varakantham

Akshat Kumar

چکیده

Decentralized Markov Decision Process (Dec-MDP) provides a rich framework to represent cooperative decentralized and stochastic planning problems under transition uncertainty. However, solving a Dec-MDP to generate coordinated yet decentralized policies is NEXP-Hard. Researchers have made significant progress in providing approximate approaches to improve scalability with respect to number of agents. However, there has been little or no research devoted to finding guarantees on solution quality for approximate approaches considering multiple (more than 2 agents) agents. We have a similar situation with respect to the competitive decentralized planning problem and the Stochastic Game (SG) model. To address this, we identify models in the cooperative and competitive case that rely on submodular rewards, where we show that existing approximate approaches can provide strong quality guarantees (a priori, and for cooperative case also posteriori guarantees). We then provide solution approaches and demonstrate improved online guarantees on benchmark problems from the literature for the cooperative case.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optimizing decentralized production–distribution planning problem in a multi-period supply chain network under uncertainty

Decentralized supply chain management is found to be significantly relevant in today’s competitive markets. Production and distribution planning is posed as an important optimization problem in supply chain networks. Here, we propose a multi-period decentralized supply chain network model with uncertainty. The imprecision related to uncertain parameters like demand and price of the final produc...

متن کامل

Learning for Multiagent Decentralized Control in Large Partially Observable Stochastic Environments

This paper presents a probabilistic framework for learning decentralized control policies for cooperative multiagent systems operating in a large partially observable stochastic environment based on batch data (trajectories). In decentralized domains, because of communication limitations, the agents cannot share their entire belief states, so execution must proceed based on local information. D...

متن کامل

Stochastic Decision Making in Manufacturing Environments

Decision making plays an important role in economics, psychology, philosophy, mathematics, statistics and many other fields. In each field, decision making consists of identifying the values, uncertainties and other issues that define the decision. In any field, the nature of the decisions is affected by environmental characteristics. In this paper, we are considered the production planning pro...

متن کامل

Informative path planning as a maximum traveling salesman problem with submodular rewards

In this paper we extend the classic problem of finding the maximum weight Hamiltonian cycle in a graph to the case where the objective is a submodular function of the edges. We consider a greedy algorithm and a 2-matching based algorithm, and we show that they have approximation factors of 1 2+κ and max{ 2 3(2+κ) , 2 3(1− κ)} respectively, where κ is the curvature of the submodular function. Bo...

متن کامل

Efficient, optimal stochastic-action selection when limited by an action budget

The problem that we consider here is a basic operations research problem, but it also a special case of the Stochastic Shortest Path with Recourse Problem and the Canadian Travellers Problem in the probabilistic path planning literature, and it is also a special case of maximizing a submodular set function subject to a matroid constraint. Specifically, suppose an agent has a task and suppose th...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2017

Decentralized Planning in Stochastic Environments with Submodular Rewards

نویسندگان

چکیده

منابع مشابه

Optimizing decentralized production–distribution planning problem in a multi-period supply chain network under uncertainty

Learning for Multiagent Decentralized Control in Large Partially Observable Stochastic Environments

Stochastic Decision Making in Manufacturing Environments

Informative path planning as a maximum traveling salesman problem with submodular rewards

Efficient, optimal stochastic-action selection when limited by an action budget

عنوان ژورنال:

اشتراک گذاری